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Abstract 

We give a realization of the stable Levy forest of a given size conditioned by 
its mass from the path of the unconditioned forest. Then, we prove an invariance 
principle for this conditioned forest by considering k independent Galton- Watson 
trees whose offspring distribution is in the domain of attraction of any stable law 
conditioned on their total progeny to be equal to n. We prove that when n 
and k tend towards +00, under suitable rescaling, the associated coding random 
walk, the contour and height processes converge in law on the Skorokhod space 
respectively towards the "first passage bridge" of a stable Levy process with no 
negative jumps and its height process. 
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1 Introduction 



The purpose of this work is to study some remarkable properties of stable Levy forests 
of a given size conditioned by their mass. 

A Galton- Watson tree is the underlying family tree of a given Galton- Watson pro- 
cess with offspring distribution \i started with one ancestor. It is well-known that 
if fi is critical or subcritical, the Galton- Watson process is almost surely finite and 
therefore, so is the corresponding Galton- Watson tree. In this case, Galton- Watson 
trees can be coded by two different discrete real valued processes: the height process 
and the contour process whose definition is recalled here in section 2. Both processes 
describe the genealogical structure of the associated Galton- Watson process. They are 
not Markovian but can be written as functionals of a certain left-continuous random 
walk whose jump distribution depends on the offspring distribution \x. In a natural 
way, Galton- Watson forests are a finite or infinite collections of independent Galton- 
Watson trees. 

The definition of Levy trees bears upon on the continuous analogue of the height 
process of Galton- Watson trees introduced by Le Gall and Le Jan in [21] as a functional 
of a Levy process with no negative jumps. Our presentation owes a lot to the recent 
paper of Duquesne and Le Gall [11] , which uses the formalism of IR— trees to define 
Levy trees that were implicit in [5] , [5] and [21] • We may consider Levy trees as random 
variables taking values in the space of compact rooted IR-trees. In a recent paper of 
Evans, Pitman and Winter [13] , IR— trees are studied from the point of view of mesure 
theory. Informally an IR— tree is a metric space (T, d) such that for any two points a 
and a' in T there is a unique arc with endpoints a and a' and furthermore this arc is 
isometric to a compact interval of the real line. In [T3], the authors also established 
that the space T of equivalent classes of (rooted) compact real trees, endowed with the 
Gromov-Hausdorff metric, is a Polish space. This makes it very natural to consider 
random variables or even random processes taking values in the space T. In this work, 
we define Levy forests as Poisson point processes with values in the set of IR-trees 
whose characteristic measure is the law of the generic Levy tree. 

First, we are interested in the construction of Levy forests of a given size conditioned 
by their mass. Again, in the discrete setting this conditioning is easier to define; the 
conditioned Galton- Watson forest of size k and mass n is a collection of k independent 
Galton- Watson trees with total progeny equal to n. In section 4, we provide a definition 
of these notions for Levy forest. Then, in the stable case, we give a construction of the 
conditioned stable Levy forest of size s > and mass 1 by rescaling the unconditioned 
forest of a particular random mass. 

In [1], Aldous showed that the Brownian random tree (or continuum random tree) is 
the limit as n increases of a rescaled critical Galton- Watson tree conditioned to have n 
vertices whose offspring distribution has a finite variance. In particular, Aldous proved 
that the discrete height process converges on the Skorokhod space of cadlag paths to 
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the normalized Brownian excursion. Recently, Duquesne [8] extended such results to 
Galton- Watson trees whose offspring distribution is in the domain of attraction of a 
stable law with index a in (1, 2]. Then, Duquesne showed that the discrete height pro- 
cess of the Galton Watson tree conditioned to have a deterministic progeny, converges 
as this progeny tends to infinity on the Skorokhod space to the normalized excursion 
of the height process associated with the stable Levy process. 

The other main purpose of our work is to study this convergence in the case of a 
finite number of independent Galton- Watson trees, this number being an increasing 
function of the progeny. More specifically, in Section 5, we establish an invariance 
principle for the conditioned forest by considering k independent Galton- Watson trees 
whose offspring distribution is in the domain of attraction of any stable law conditioned 
on their total progeny to be equal to n. When n and k tend towards oo, under suit- 
able rescaling, the associated coding random walk, the contour and height processes 
converge in law on the space of Skorokhod towards the first passage bridge of a stable 
Levy process with no negative jumps and its height process. 

In section 2, we introduce conditioned Galton- Watson forests and their related 
coding first passage bridge, height process and contour process. Section 3 is devoted to 
recall the definitions of real trees and Levy trees and to state a number of important 
results related to these notions. 

2 Discrete trees and forests. 

In all the sequel, an element u of (W) n is written as u = (u\, . . . u n ) and we set \u\ = n. 
Let 

oo 

U= |J(N*) n , 

n=0 

where N* = {1,2,...} and by convention (N*)° = {0}. The concatenation of two 
elements of U, let us say u = (u\, . . . u n ) and v = (vx, . . . ,v m ) is denoted by uv = 
(ui, . . . u n , v i, . . . , v m ). A discrete rooted tree is an element r of the set U which satisfies: 

(i) G r, 

(ii) If v £ r and v = uj for some j G N*, then u G r. 

(iii) For every u G r, there exists a number k u (r) > 0, such that uj G r if and only if 
1 <j < K(r). 

In this definition, k u (r) represents the number of children of the vertex u. We denote 
by T the set of all rooted trees. The total cardinality of an element r G T will be 
denoted by C( r ); ( we emphasize that the root is counted in C( r ))- If t G T and u G r, 
then we define the shifted tree at the vertex u by 

6u{t) = {v eU : uv e t} . 
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We say that u e r is a leaf of r if fc u (r) = 0. 

Then we consider a probability measure /i on Z + , such that 

oo 

^2kn(k)<l and /i(l)<l. 

fc=0 

The law of the Galton- Watson tree with offspring distribution /i is the unique proba- 
bility measure Q M on T such that: 

(u) For every j > 1, with > 0, the shifted trees #i(t), . . . , ^ (r) are independent 
under the conditional distribution Q M ( • | k$ — j) and their conditional law is Q M . 

A Galton- Watson forest with offspring distribution /i is a finite or infinite sequence of 
independent Galton- Watson trees with offspring distribution //. It will be denoted by 
F = (jk). With a misuse of notation, we will denote by Q M the law on (T) N * of a 
Galton- Watson forest with offspring distribution /i. 

It is known that the G-W process associated to a G-W tree or forest does not code 
entirely its genealogy. In the aim of doing so, other (coding) real valued processes 
have been defined. Amongst such processes one can cite the contour process, the height 
process and the associated random walk which will be called here the coding walk and 
which is sometimes referred to as the Luckazievicks path. 

Definition 1. We denote by u T (0) = 0, u T (l) = 1, . . . , u T (( — 1) the elements of a tree 
t which are enumerated in the lexicographical order (when no confusion is possible, we 
will simply write u(n) foru T (n)). Let us denote by \u{n)\ the rank of the generation of 
a vertex u(n) G r. 

(1) The height function of a tree r is defined by 

n ^ H n (r) = \u(n)\, 0<n< £(t) - 1 . 

(2) The height function of a forest T = (r k ) is defined by 

n ^ H n (F) = H n _( < ( To)+ ... +C ( Tk _ 1 ))(T k ), 

if C(to) + • • • + Cfa-i) < n < C(r ) + • • • + C(r k ) - 1, 

for k > 1, and with the convention that C( r o) = 0. If there is a finite number of 
trees in the forest, say j, then we set H n (T) = 0, for n > C( T o) + • • • + C( r j)- 

For two vertices u and v of a tree r, the distance d T (u, v) is the number of edges of the 
unique elementary path from u to v. The height function may be presented in a natural 
way as the distance between the visited vertex and the root 0, i.e. H n (r) = d T (0, u[n)). 
Then we may check the following relation 

d T (u(n),u(m)) = H n {r) + H m (r) - 2# fe(n , m) (r) , (2.1) 
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where k{n,m) is the index of the last common ancestor of u(n) and u(m). It is not 
difficult to see that the height process of a tree (resp. a forest) allows us to recover the 
entire structure of this tree (resp. this forest). We say that it codes the genealogy of 
the tree or the forest. Although this process is natural and simple to define, its law is 
rather complicated to characterize. In particular, H is neither a Markov process nor a 
martingale. 

The contour process gives another characterization of the tree which is easier to 
visualize. We suppose that the tree is embedded in a half-plane in such a way that 
edges have length one. Informally, we imagine the motion of a particle that starts at 
time from the root of the tree and then explores the tree from the left to the right 
continuously along each edge of r at unit speed until all edges have been explored and 
the particle has come back to the root. Note that if u(n) is a leaf, then the particle 
goes to u(n + 1), taking the shortest way that consists first to move backward on the 
line of descent from u(n) to their last common ancestor u(n) A u(n + 1) and then to 
move forward along the single edge between u{n) A u(n + 1) to u(n + 1). Since it is 
clear that each edge will be crossed twice, the total time needed to explore the tree 
is 2(£(t) — 1). The value C t (r) of the contour function at time t G [0, 2(£(t) — 1)] 
is the distance (on the continuous tree) between the position of the particle at time 
s and the root. More precisely, let us denote by l\ < li < ■ ■ ■ < l v the p leaves of r 
listed in lexicographical order. The contour function (C t (r),0 < t < 2(£(t) — 1)) is 
the piecewise linear continuous path with slope equal to +1 or -1, that takes successive 
local extremes with values: 0, \l\ A Z2I, ■ ■ ■ \lp-i A l p \, \l p \ and 0. Then we set 
Ct(r) = 0, for t G [2(£(r) — 1), 2£(r)]. It is clear that C(r) codes the genealogy of r. 

The contour process for a forest T = (r k ) is the concatenation of the processes 
C(ti), . . . , C(r k ), . . . , i.e. for k > 1: 

C t (F) = C t - 2{c{To)+ ... +c{Tk _ l)) (T k ), if 2(C(r ) + - • -+C(t*-i)) < * < 2(C(r ) + - • - + C(r fc )). 

If there is a finite number of trees, say j, in the forest, we set C^J 7 ) = 0, for t > 
2(CCnO + • • • + Cfa))- Note that for each tree r k , [2(C(r fc ) - l),2C(r fc )] is the only 
non-trivial subinterval of [0, 2(^(r k )] on which C(r k ) vanishes. This convention ensures 
that the contour process C(F) also codes the genealogy of the forest. However, it has 
no "good properties" in law either. 

In order to define a coding process whose law can easily be described, most of the 
authors introduce the coding random walk S{r) which is defined as follows: 

S = , S n+1 (r) - S n (r) = k u{n) (r) - 1, < n < C(r) - 1 . 

Here again it is not very difficult to see that the process S{r) codes the genealogy of 
the tree r. However, its construction requires a little bit more care than this of H(t) or 
C{t). For each n, S n (r) is the sum of all the younger brother of each of the ancestor of 
u(n) including u(n) itself. For a forest T = (r fc ), the process S'(JF) is the concatenation 
of S(n),...,S(T k ),...: 

S n (F) = 5'n-(C(ro)+-+C(Tfc-i))( T "fe) -k+1, 

if C(r ) + • • • + Cfa-i) < n < C(ro) + • • • + C(r k ). 
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If there is a finite number of trees j, then we set S^J 7 ) = S^ TQ ) + ... + ^ Tj )(J r ), for 
n > C( r o) + " " ' + C( r j)- From the construction of S(ti) it appears that S^Tx) is a random 
walk with initial value S = and step distribution v{k) = jj,{k + 1), k — —1, 0, 1, . . . 
which is killed when it first enters into the negative half-line. Hence, when the num- 
ber of trees is infinite, S^JF) is a downward skip free random walk on Z with the law 
described above. 
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Figure 1 



Let us denote H^J 7 ), C(JF) and respectively by H, C and S when no confusion is 
possible. In the sequel, we will have to use some path relationships between H, C and 
S which we recall now. Let us suppose that T is infinite. It is established for instance 
in that 

H n = card {0 < k < n - 1 : S k = inf SA . (2.2) 

k<j<n 

This identity means that the height process at each time n can be interpreted as the 
amount of time that the random walk S spends at its future minimum before n. The 
following relationship between H and C is stated in [9]: set K n = 2n — H n , then 

/ (H n -(t-K n ))+, ifte[K n ,K n+1 -l] 
1 \ (H n+1 - (K n+1 - t))+ , if [K n+1 - 1, K n+1 ] . [Z - 6) 

For any integer k > 1, we denote by T k ' n a G-W forest with k trees conditioned to have 
n vertices, that is a forest with the same law as T = (r 1; . . . , r^) under the conditional 
law Q M ( • | C( r i) + • • • + C( r fc) = n). The starting point of our work is the observation 
that J rh,n can be coded by a downward skip free random walk conditioned to first reach 
—k at time n. An interpretation of this result may be found in [22J, Lemma 6.3 for 
instance. 
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Proposition 1. Let T = (tj) be an infinite forest with offspring distribution fi and S, 
H and C be respectively its coding walk, its height process and its contour process. Let 
W be a random walk defined on a probability space P) with the same law as S. 

We define Tf 1 = inf{j : Wj = —i}, fori > 1. Take k and n such that P{T k w = n) > 0. 
Then under the conditional law Q M ( ■ | C( r i) + ' ' ' + C( r A;) = n ), 

(1) The process (Sj, < j < C( r i) + • • • + C( r fc)) has the same law as (Wj, < j < 

Moreover, define the processes = card {k G {0, . . . ,n — 1} : W^ = ini k <j< n Wj} 
and C w using the height process H w as in (\2.3\) . then 

(2) the process (Hj, < j < C( r i) + ' ' ' + C( r fc)) has the same law as the process 
(^f,0<j<Tf). 

(3) the process (C t , < < t < 2(((ti) + • • • + C( r fc))) has the same law as the 
process (Of, < t < 2T^). 

It is also straightforward that the identities in law involving separately the processes 
H, S and C in the above proposition also hold for the triple (H, S, C). In the figure 
below, we have represented an occurrence of the forest T k,n and its associated coding 
first passage bridge. 



2 3 






V 



t~2 r 3 

Conditioned random forest: k trees, n vertices. 



Coding first passage bridge 
Figure 2 



In Section HI we will present a continuous time version of this result, but before we 
need to introduce the continuous time setting of Levy trees and forests. 
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3 Coding real trees and forests 



Discrete trees may be considered in an obvious way as compact metric spaces with 
no loops. Such metric spaces are special cases of IR-trees which are defined hereafter. 
Similarly to the discrete case, an IR-forest is any collection of IR-trees. In this section 
we keep the same notations as in Duquesne and Le Gall's articles [9] and [TT]. The 
following formal definition of IR-trees is now classical and originates from T-theory. It 
may be found for instance in [7]. 

Definition 2. A metric space (T,d) is an IR-tree if for every <J\,oi G T, 

1. There is a unique map f ai .a 2 from [0, d(a\, a^)] into T such that /o- li(T2 (0) = o\ 
and / CTliCT2 (d(ai,(72)) = cr 2 . 

2. If g is a continuous infective map from [0, 1] into T such that g(0) = o\ and 
g(l) = o- 2 , we have 

g([0,l]) = fr 1 , a2 ([0,d(a 1 ,a 2 )}). 

A rooted IR-tree is an IR-tree (T, d) with a distinguished vertex p = p{T) called the 
root. An IR-forest is any collection of rooted IR-trees: T = {(%,, di), i G I}. 

A construction of some particular cases of such metric spaces has been given by Aldous 
PQ and is described in [11] in a more general setting. Let / : [0, oo) — > [0, oo) be a 
continuous function with compact support, such that /(0) = 0. For < s < t, we 
define 

d f (s,t) = f(s) + f(t)-2 MJ(u) (3.4) 

and the equivalence relation by 

s ~ t if and only if df(s, t) = . 

(Note that df(s,t) = if and only if f(s) = fit) = inf uG [ s>t ] f(u).) Then the projection 
of df on the quotient space 

T / = [0,oo)/~ 

defines a distance. This distance will also be denoted by df. 
Theorem 1. The metric space (Tf,df) is a compact IR-tree. 

Denote by pf : [0, oo) — > Tf the canonical projection. The vertex p = p/(0) will be 
chosen as the root of Tf. It has recently been proved by Duquesne [8] that any IR-tree 
(satisfying some rather weak assumptions) may by represented as (Tf,df) where / is 
a left continuous function with right limits and without positive jumps. 

When no confusion is possible with the discrete case, the space of IR-trees will also 
be denoted by T. It is endowed with the Gromov-Hausdorff distance, d GH which we 
briefly recall now. For a metric space (E, 5) and K, K' two subspaces of E, 5 najVLS (K, K 1 ) 
will denote the Hausdorff distance between K and K' . Then we define the distance 
between T and T' by: 

d GH (T,T>) = inf (S Haus (^TW(T')) y5(ip(p),if\p'))) , 
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where the infimum is taken over all isometric embeddings ip : T — > E and ip' :T' — > E 
of T and T into a common metric space (E, 5). We refer to Chapter 3 of Evans [12] and 
the references therein for a complete description of the Gromov-Hausdorff topology. It 
is important to note that the space (T, (1gh) is complete and separable, see for instance 
Theorem 3.23 of [12] or [13]. 

In the remainder of this section, we will recall from [11] the definition of Levy 
trees and we state this of Levy forests. Let (f x ), x G IR be a sequence of probability 
measures on the Skorokhod space D of cadlag paths from [0, oo) to IR such that for 
each x G IR, the canonical process X is a Levy process with no negative jumps. Set 
P = Pq, so F x is the law of X + x under P. We suppose that the characteristic exponent 
ip of X (i.e. E(e _AX ') = e*^ (A) , A G IR) satisfies the following condition: 

du 

— -<oo. (3.5) 

ip(u) 

By analogy with the discrete case, the continuous time height process H is the measure 
(in a sense which is to be defined) of the set {s < t : X s = inf s < r <fX r }. A rigorous 
meaning to this measure is given by the following result due to Le Jan and Le Gall 
[2T] . see also [9]. Define J| = mi s < u < t X u . There is a sequence of positive real numbers 
(gfc) which decreases to such that for any t, the limit 
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H t (d = — / l{ Xs -ii<e k } ds (3.6) 



exists a.s. It is also proved in [21] that under assumption (I3.5p . H is a continuous 
process, so that each of its positive excursion codes a real tree in the sense of Aldous. 
We easily deduce from this definition that the height process H is a functional of the 
Levy process reflected at its minimum, i.e. X — I, where / := J°. In particular, when 
a — 2, H is equal in law to the reflected process multiplied by a constant. It is well 
known that X — I is a strong Markov process. Moreover, under our assumptions, is 
regular for itself for this process and we can check that the process —I is a local time 
at level 0. We denote by iV the corresponding Ito measure of the excursions away from 
0. 

In order to define the Levy forest, we need to introduce the local times of the 
height process H. It is proved in [9] that for any level a > 0, there exists a continuous 
increasing process (L", t > 0) which is defined by the approximation: 



limE I sup 

e|0 \0<s<t 



0. (3.7) 



The support of the measure dL^ is contained in the set {t > : H t = a} and we readily 
check that L° = —I. Then we may define the Poisson point process of the excursions 
away from of the process H as follows. Let T u = inf{£ : —It > u} be the right 
continuous inverse of the local time at of the reflected process X — I (or equivalently 
of H). The time T u corresponds to the first passage time of X bellow —u. Set T _ = 
and for all u > 0, 



(v) 



H Tu .+v , if < v < T u - T„_ 
, if v > T u - T u _ 
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For each u > 0, we may define the tree (T eu ,d £u ) under P as in the beginning of 
this section. We easily deduce from the Markov property of X — I that under the 
probability measure P, the process {(% u , d £u ),u > 0} is a Poisson point process whose 
characteristic measure is the law of the random real tree (Tg, dg) under N. By analogy 
to the discrete case, this Poisson point process, as a T- valued process, provides a natural 
definition for the Levy forest. 

Definition 3. The Levy tree is the real tree (Tg, dg) coded by the function H under 
the measure N . We denote by Q(dT) the a-finite measure on T which is the law of the 
Levy tree Tg under N . The Levy forest Tg is the Poisson point process 

(Tg(u),u>0) ^ {(% K ,d eu ),u>0} 

which has for characteristic measure Q(dT) under P. For each s > 0, the process 
Tg {d = {(T Cu , d eu ), < u < s} under P will be called the Levy forest of size s. 

Such a definition of a Levy forest has already been introduced in |22j, Proposition 7.8 
in the Brownian setting. However in this work, it is observed that the Brownian forest 
may also simply be defined as the real tree coded by the function H under law P. We 
also refer to |23j where the Brownian forest is understood in this way. Similarly, the 
Levy forest with size s may be defined as the compact real tree coded by the continuous 
function with compact support (H u , < u < T s ) under law P. These definitions are 
more natural when considering convergence of sequences of real forests and we will 
make appeal to them in section 5, see Corollary [TJ 

We will simply denote the Levy tree and the Levy forest respectively by Tg, Tg or 
Tg, the corresponding distances being implicit. When X is stable, condition (13.51) is 
satisfied if and only if its index a satisfies a £ (1,2). We may check, as a consequence 
of (13.61) . that H is a self-similar process with index a/ (a — 1), i.e.: 

(H t , t > 0) = {k (a ~ 1)/a H kt , t>0), for all k > 0. 

In this case, the Levy tree Tg associated to the stable mechanism is called the a-stable 
Levy tree and its law is denoted by Q a (dT). This random metric space also inherits 
from X a scaling property which may be stated as follows: for any a > 0, we denote 
by aTg the Levy tree Tg endowed with the distance adg, i.e. 

aTg (d = (Tg, adg) . (3.8) 

Then the law of aTg under Q a (dT) is a'^Q^dT). This property is stated in [20] 
Proposition 4.3 and [TO] where other fractal properties of stable trees are considered. 
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4 Construction of the conditioned Levy forest 



In this section we present the continuous analogue of the forest T k,n introduced in 
section 2. In particular, we define the total mass of the Levy forest of a given size 
s. Then we define the Levy forest of size s conditioned by its total mass. In the sta- 
ble case, we give a construction of this conditioned forest from the unconditioned forest. 

We begin with the definition of the measure £ a ' u which represents a local time at 
level a > for the Levy tree % u . For all a > 0, u > and for every bounded and 
continuous function f on T £u , the finite measure £ a,u is defined by: 

pT u —T u _ 

<r >u M= / dL a Tu+v y(p eu {v)) , (4.9) 

Jo 

where we recall from the previous section that p Cu is the canonical projection from 
[0, oo) onto % u for the equivalence relation ~ and (L£) is the local time at level a of 
H . Then the mass measure of the Levy tree T Su is 

/•oo 

m u = / dal a ' u (4.10) 
Jo 

and the total mass of the tree is m u (% u ). Now we fix s > 0; the total mass of the 
forest of size s, T S R is naturally given by 

M s = m u{T eu ). 

0<u<s 

Proposition 2. ¥ -almost surely T s = M s . 

Proof. It follows from the definitions (14.91) and (14.101) that for each tree T £u , the mass 
measure m u coincides with the image of the Lebesgue measure on [0, T u — T u _] under 
the mapping v i— > p eu {v). Thus, the total mass m u {T eu ) of each tree T £u is T u — T u _. 
This implies the result. ■ 



Then we will construct processes which encode the genealogy of the Levy forest of size 
s conditioned to have a mass equal to t > 0. From the analogy with the discrete case 
in Proposition [TJ the natural candidates may informally be defined as: 

X br ( = f) [(X u , 0<u<T s )\T s = t] 
H br ( = f) [(H u ,0<u<T s )\T s = t}. 

When X is the Brownian motion, the process X br is called the first passage bridge, 
see In order to give a proper definition in the general case, we need the additional 
assumption: 

The semigroup of (X, P) is absolutely continuous with respect to the Lebesgue measure. 

Then denote by p t (-) the density of the semigroup of X, by Q% ( = ] a{X v , v<u},u>0 
the ex-field generated by X and set pt(x) = p t (—x). 
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Lemma 1. The probability measure which is defined on each by 

WY* tz \ \-w(f[ t(s + X u ) p t ^ u (s + X u ) \ x 

V s(t-u) pt{s) J 

(4.H) 

is a regular version of the conditional law of (X u , < u < T s ) given T s = t, in the 
sense that for all u > 0, for X-a.e. s > and X-a.e. t > u, 

¥(X br G A u ) = limP(X e A u \ \T S - t\ < e) , 
where A is the Lebesgue measure. 

Proof. Let u < t, A u £ and e <t — u. From the Markov property, we may write 

r( xz K \\T,- t \<e) = E( 1{X6A , } p( ^-;j<;> g) ) 

On the other hand, from Corollary VII. 3 in [2] one has, 

tP(T s G rft) ds = sp t (s) dt ds . (4.13) 
Hence, for all x e IR, for all u > 0, for A-a.e. s > and A-a.e. t > u, 

¥ X (\T 8 - (t - u)\ <e) = t(g + x)p t „ M ( g + x) 
™ P(|T a -t|<e) s(t-u) p t (s) 

Moreover we can check from fTCT3|) that E ^'^^"^ < +oo for A-a.e. t, so 

the result follows from (14.121) and Fatou's lemma. ■ 



We may now construct a height process H from the path of the first passage bridge 
X br exactly as H is constructed from X in (13. 6p or in Definition 1.2.1 of [9] and check 
that the law of H br is a regular version of the conditional law of (H u , < u < T s ) given 
T s = t. Call (e*'*, < u < s) the excursion process of H br , that is in particular 

(e*'*, < u < s) has the same law as (e u , < u < s) given T s = t . 

The following proposition is a straightforward consequence of the above definition and 
Proposition [2j 

Proposition 3. The law of the process {{T e s,t^d e s,t) 1 < v < s} is a regular version 
of the law of the forest of size s, J-g given M s = t. 

We will denote by {Fj(u), < u < s) a process with values in T whose law under P 
is this of the Levy forest of size s conditioned by M s = t, i.e. conditioned to have a 
mass equal to t. 
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In the remainder of this section, we will consider the case when the driving Levy 
process is stable. We suppose that its index a belongs to (1,2] so that condition (13.51) 
is satisfied. We will give a pathwise construction of the processes (X br , H br ) from the 
path of the original processes (X,H). This result leads to the following realization 
of the Levy forest of size s conditioned by its mass. From now on, with no loss of 
generality, we suppose that t — 1. 

Theorem 2. Define g = sup{u < 1 : T u i/ a — s ■ u}. 

(1) IP '-almost surely, 

0<g<l. 

(2) Under F, the rescaled process 

{g {1 - a)/a H{gu), < u < 1) (4.14) 
has the same law as H br and is independent of g. 

(3) The forest T s ^ of size s and mass 1 may be constructed from the rescaled process 
defined in ( \4-14h i-e. if we denote by u i— > e u = {g^ a ^ a e u {gv),v > 0) its 
process of excursions away from 0, then under P, JFJ^ 1 = {(T eu , d eu ), < u < s}. 

Proof. The process T u = inf{t> : I v < — u] is a stable subordinator with index 1/a. 
Therefore, 

T u < su a , i.o. as u [ and T u > su a , i.o. as u j 0. 

Indeed, if u n | then P(T Un < s<) = P(T X < s) > 0, so that P(limsup n {T Un < 
su n}) — P(Ti < s) > 0. But T satisfies Blumenthal 0-1 law, so this probability is 1. 
The same arguments prove that P(limsup n {T Un > sw"}) = 1 for any sequence u n j 0. 
Since T has only positive jumps, we deduce that T u = su a infinitely often as u tends 
to 0, so we have proved the first part of the theorem. 

The rest of the proof is a consequence of the following lemma. 

Lemma 2. The first passage bridge X br enjoys the following path construction: 

X br ®(g- 1 ' a X(gu),0<u<l). 

Moreover, the process {g~ l ^ a X{gu)^ < u < 1) is independent of g. 

Proof. First note that for any t > the bivariate random variable (X t , I t ) under P is 
absolutely continuous with respect to the Lebesgue measure and there is a version of 
its density which is continuous. Indeed from the Markov property and (14.131) . one has 
for all x G IR and y > 0, 

iv, <»i*=*) = e (%,, /'-;<;- a) ) 

Jo s Pt{0) 
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Looking at the expressions of pt(x) and Pt{x) obtained from the Fourier inverse of the 
characteristic exponent of X and —X respectively, we see that theses functions are 
continuously different iable and that their derivatives are continuous in t. It allows us 
to conclude. 

Now let us consider the two dimensional self-similar strong Markov process Y ( = f> 
(X,I) with state space {(x, y) £ IR 2 : y < x}. ^From our preceding remark, the 
semi-group qt((x, y), (dx', dy')) = F(X t + x £ dx', y A (I t + x) £ dy') of Y is absolutely 
continuous with respect to the Lebesgue measure and there is a version of its density 
which is continuous. Denote by q t ((x,y), (x',y')) this version. We derive from (14.131) 
that for all — s < x, 

Qt({x, y), (s, -s)) = ^{ y >-s}jPt(s + x) . (4.15) 

Then we may apply a result due to Fitzsimmons, Pitman and Yor [15] which asserts 
that the inhomogenous Markov process on [0, £], whose law is defined by 

E( B( n, g <«)^;^ff ) in-(», y )), o <««, ( 4.i 6) 

where if is a measurable functional on C([0, u], IR 2 ), is a regular version of the condi- 
tional law of (Y v , < v < t) given Y t = (x',y'), under P( • | Yq = (x,y)). This law is 
called the law of the bridge of Y from (x, y) to (x', y') with length t. Then from (14.151) . 
the law which is defined in (14.161) . when specifying it on the first coordinate and for 
(x, y) = (0, 0) and (x', y') = (— s, —s), corresponds to the law of the first passage bridge 
which is defined in (14.111) . 

It remains to apply another result which may also be found in [T5]: observe that 
g is a backward time for Y in the sense of [TS]. Indeed g we may check that g = 
sup{u < 1 : X u = -su 1 /", X u = /„}, so that for all u > 0, {g > u} £ cr(Y v : v > u). 
Then from Corollary 3 in |15j . conditionally on g, the process (Y u , < u < g) under 
P( • | Y — (0, 0)) has the law of a bridge from (0, 0) to Y g with length g. (This result 
has been obtained and studied in a greater generality in [B].) But from the definition of 
g, we have Y g = (—sg 1 ' 01 , —sg 1 ^), so from the self-similarity of Y, under P the process 

(g- 1/a Y(g-u),0<u<l) 

has the law of the bridge of Y from (0, 0) to (— s, —s) with length 1. The lemma follows 
by specifying this result on the first coordinate. ■ 

The second part of the theorem is a consequence of Lemma HI the construction of H 
from X^ and the scaling property of H . The third part follows from the definition of 
the conditioned forest Tp} in Proposition [2] and the second part of this theorem. ■ 
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5 Invariance principles 



We know from Lamperti that the only possible limits of sequences of re-scaled G-W 
processes are continuous state branching processes. Then a question which arises is: 
when can we say that the whole genealogy of the tree or the forest converges ? In 
particular, do the height process, the contour process and the coding walk converge 
after a suitable re-scaling ? This question has now been completely solved by Duquesne 
and Le Gall [9]. Then one may ask the same for the trees or forests conditioned by 
their mass. In [8J, Duquesne proved that when the law v is in the domain of attraction 
of a stable law, the height process, the contour process and the coding excursion of the 
corresponding G-W tree converge in law in the Skorokhod space of cadlag paths. This 
work generalizes Aldous' result which concerns the Brownian case. In this section 
we will prove that in the stable case, an invariance principle also holds for sequences 
of G-W forests conditioned by their mass. 

Recall from section [2] that for an offspring distribution // we have set v{k) = fi(k+l), 
for k — — 1,0,1, We make the following assumption: 

{/i is aperiodic and there is an increasing sequence (a n ) n > 
such that a n — * +oo and S n /a n converges in law asn^ +oo 
toward the law of a non-degenerated r.v. 9. 

Note that we are necessarily in the critical case, i.e. kfi(k) = 1, and that the law 
of 9 is stable. Moreover, since u(— oo, —1) = 0, the support of the Levy measure of 9 is 
[0, oo) and its index a is such that 1 < a < 2. Also (a n ) is a regularly varying sequence 
with index a. Under hypothesis (H), it has been proved by Grimvall [16] that if Z is 
the G-W process associated to a tree or a forest with offspring distribution /z, then 

(—Z [nt/an] , t > j => (Z t , t > 0) , as n -> +oo, 

where (Z t , t > 0) is a continuous state branching process. Here and in the sequel, 
will stand for the weak convergence in the Skorokhod space of cadlag trajectories. 
Recall from section 2 the definition of the discrete process (S,H,C). Under the same 
hypothesis, Duquesne and Le Gall have proved in [9], Corollary 2.5.1 that 



— S[nt], — H[nth — C 2n t I , t > 

a„ n n 



[{X t ,H t ,H t ), t >0)] , asn^+oo, (5.17) 



where X is a stable Levy process with law 9 and H is the associated height process, as 
defined in section 3. 

Again we fix a real s > and we consider a sequence of positive integers (k n ) such 
that 

k 

— — > s , as n — > +oo. (5.18) 

a n 

Recall the notations of section 2. For any n > 1, let (X br ' n , H br ' n , C br,n ) be the process 
whose law is this of 



( S[nth —Hl n t}, — C 2n t J , < t < 1 

\a n n n J 
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under Q At (- | C( r i) + "'' + C( T k n ) = n). Note that we could also define this three 
dimensional process over the whole halfline [0, oo), rather than on [0,1]. However, 
from the definitions in section 2, H br,n and C br,n would simply vanish over [1, oo) and 
X br,n would be constant. Here is the conditional version of the invariance principle 
that we have recalled in (I5.17p . 

Theorem 3. As n tends to +oo ; we have 

^j£br,n j^br,n ^jbr,n^ (J^^ )r fj br H br \ 

In order to give a sense to the convergence of the Levy forest, we may consider the 
trees T br,n and T br which are coded respectively by the continuous processes with 
compact support, C br ' n and H br , in the sense given at the beginning of section 3 (here 
we suppose that these processes are defined on [l,oo) and both equal to on this 
interval). Roughly speaking the trees T br ' n and T br are obtained from the original 
(conditioned) forests by rooting all the trees of these forests at a same root. 

Corollary 1. The sequence of trees T br ' n converges weakly in the space T endowed with 
the Gromov- Haus dor ff topology towards T br . 

Proof. This results is a consequence of the weak convergence of the contour function 
C br ' n toward H br and the inequality 

d GH (T g ,T g ,) < 2\\g - g'\\ , 

which is proved in [TT], see Lemma 2.3. (We recall that dcH the Gromov-Hausdorff 
distance which has been defined in section 3.) ■ 



A first step for the proof of Theorem [3] is to obtain the weak convergence of (X br ' n , H br,n ) 
restricted to the Skorokhod space D([0, t}) for any t < 1. Then we will derive the 
convergence on D([0, 1]) from an argument of cyclic exchangeability. The convergence 
of the third coordinate C br,n is a consequence of its particular expression as a functional 
of the process H br,n . In the remainder of the proof, we suppose that S is defined on 
the same probability space as X and has step distribution v under P. Define also 
T k = inf{2 : Si = —k}, for all integers k > 0. Hence the process (X br > n , H^ n } C br ' n ) 
has the same law as 

( S[ nt ], —H[ nt ], — C 2n t J , < t < 1 , 

\a n n n J 

under the conditional probability P( ■ | T kn = n). 

Lemma 3. For any t < 1, as n tends to +oo, we have 

K r '", H^), 0<u<t}^ [(X b J, 2%), < y < t] . 
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Proof. From Feller's combinatorial lemma, see |14j . we have P(T^ = n) = -¥(S n = —k), 
for all n > 1, k > 0. Let F be any bounded and continuous functional on D([0, £]). By 
the Markov property at time [nt], 



E[F(Xf n , iC' n ; < u < t)} = E 



S[nu], —H[ nu ]] < U < t \ | 7/, // 

a„ n 



w fir V s [nt] (T kn = n-[nt]) (1 a 

= E l {[nt]<T } — r x F —S [nu] , —H [nu] ; 0<u<t 

V ™(^fc„ = n ) \ a n n 

( n(k n + S [nt] ) P S[ „ t] (gn-[nt] = -fcn) 

xF (—S[ nu ], —H[ nu y, <u <t 

where S_ k = infj<fc Si. To simplify the computations in the remainder of this proof, we 

set P( n ) for the law of the process (^-S[ nu ], ^H[ nu y,u > CM and P will stand for the 

law of the process (X u , H u ; u > 0). Then Y = (Y 1 , Y 2 ) is the canonical process of the 
coordinates on the Skorokhod space D 2 of cadlag paths from [0, oo) into IR 2 . We will 
also use special notations for the densities introduced in (14. lip and (15. 19ft : 

n it S + Y t P^Y^ + s) 

- ^-> a (i-t) W) ' and 



(n) _ ,, n^nT^i^] 



D K! = V 1 > 



U>~^ k n (n-[nt\) F(S n = -k n ) 

where Yj = inf u < t Y^ 1 . Put also F t for F(Y U , < u < t). To obtain our result, we have 
to prove that 

lim |£ (n) (F t A (n) ) - E{F t D t )\ = . (5.20) 

n— ++oo 

Let M > and set Im{x) ( = ] TL[- St M](x). By writing 

E {n) (F t D { t n) ) = EM(F t Dl n) I M (Y?)) + E^(F t D^(l - I M (Y?)) 

and by doing the same for E(F t D t ), we have the following upper bound for the term 
in ([ODD 

\ E M(FtDl n) ) - E(F t D t )\ < | E (n) (F t D^ I M ( Y/ ) ) - £(F 4 ZVm(^))| 

+C£ ;(n)p(n) (l _ j^l))) + C E(D t (l - I M {Y?))) , 

where C is an upper bound for the functional F. But since D t and £> t (n) are densities, 

£(n)p(")) = x and £(£> t ) = 1; hence 

| E (n) (FtA W ) _ jE;(FtA) | < ^(^(F^/m^ 1 ))-^^^/^ 1 ))! (5.21) 

+C[1 - EV>{Dfl M <y}))\ + <7[1 - £?(A/m(^))] • 
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Now it remains to prove that the first term of the right hand side of the inequality 
(PEHD tends to 0, i.e. 

\E^\F t D { ; i) I M {Y t 1 )) - E(F t D t I M {Yl))\ - 0, (5.22) 

as n — > +oo. Indeed, suppose that (15.221) holds, then by taking F t = 1, we see that the 
second term of the right hand side of (15.211) converges towards the third one. Moreover, 
E(D t lM(Yt)) tends to 1 as M goes to +oo. Therefore the second and the third terms 
in (I5.2ip tend to as n and M go to +oo. 

Let us prove (I5.22p . From the triangle inequality and the expression of the densities 
D t and D\ , we have 

\E^(F t Dl n) I M (Y t 1 )) - E(F t D t I M (Y^))\ < sup \g n {x) - g(x)\ + 

xE[-s,M] 

\E^ n \F t D t I M {Yl)) - E(F t D t I M (Y t 1 )) \ , (5.23) 

where g n (x) = gfegy^fegg^ and g(x) = ^^fef But thanks to Gne- 
denko local limit theorem and the fact that k n /a n — > s, we have 

lim sup \g n (x) - g(x)\ = 0. 
n ^+°° xe{-s,M] 

Moreover, recall that from Corollary 2.5.1 of Duquesne and Le Gall [9], 

p(n) ^ p ^ 

asn-> +oo, where ^> stands for the weak convergence of measures on D 2 . Finally, note 
that the discontinuity set of the functional F t D t lM{Xt) is negligible for the probability 
measure P so that the last term in ( 15. 23ft tends to as n goes to +oo. ■ 

Then we will prove the tightness of the sequence, (X br,n , H br,n ) . Define the height 
process associated to any downward skip free chain x = (x , X\, . . . , x n , . . . ), i.e. x = 
and Xi — > —1, as follows: 

H* = card \ i G {0, . . . , n — 1} : Xk = inf xA . 

i<j<n 

Let also t(k) be the first passage time of x by t(k) = inf{i : x^ = —k} and for n > k, 
when t(k) < oo, define the shifted chain: 

() _(x i+m +k, i£i<n-t(k) 
m [ h \ x m+l . n + x n + k, is n - t(k) < i < n ' u, i, . . . , n , 

which consists in inverting the pre-t(k) and the post-t(k) parts of x and sticking them 
together. 

Lemma 4. For any k > 0, we have almost surely 
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Proof. It is just a consequence of the fact that t(k) is a zero of H x . 



Lemma 5. Let u^ n be a random variable which is uniformly distributed over {0, 1, . . . , k n } 
and independent of S . Under P( ■ | T(k n ) = n), the first passage time T[uk n ) is uni- 
formly distributed over {0, 1, . . . , n}. 

Proof. It follows from elementary properties of random walks that for all k e {0,1,..., k n }, 
under P( ■ | T(k n ) = n), the chain 0T(k n ){S) has the same law as (Si, < i < n). As a 
consequence, for all j & {0,1, ... , n}, 

P(T(k) = j | T(k n ) =n) = P(T(k n -k)=n-j\ T(k n ) = n) , 

which allows us to conclude. ■ 



Lemma 6. The family of processes 

(X br ' n , H br ' n ) , n > 1 

is tight. 

Proof. Let D([0, t\) be the Skorokhod space of cadlag paths from [0, t] to IR. In Lemma 
E]we have proved the weak convergence of (X br ' n , H br,n ) restricted to the space D([0, t}) 
for each t > 0. Therefore, from Theorem 15.3 of [1], it suffices to prove that for all 
5 e (0, 1) and t] > 0, 

lim lim sup P ( sup \Xf' n - X br ' n \ > V , sup \H br ' n - H br ' n \ > r] ) = . (5.24) 

5^0 n ^ +0 o Ys,te[l-<5,1] s,t£[l-8,l] J 

Recall from Lemma[5]the definition of the r.v. Uk n and define V n = inf{£ : X br,n = — -}. 
Since from this lemma, V n is uniformly distributed over {0, 1/n, . . . , 1 — 1/n, 1}, we have 
for any e < 1 — 5, 

P ( sup \X br ' n - X br > n \ > v , sup \H br ' n - H br > n \ > rj ) < e + 5 4 

\s,te[l-(5,l] s,tE[l-8,l] 

P ( V n G [e, 1 - 5], sup \X^' n - X br ' n \ > t], sup \H br ' n - H br ' n \ > r] 

V s,te{l-8,l] s,t6[l-i5,l] 

Now for a cadlag path u> defined on [0, 1] and t e [0, 1], define the shift: 
al x f u s+t + u, if s < 1 — t r , n1l 

which consists in inverting the paths (u u , < u < t) and (a;„, t < u < 1) and sticking 
them together. We can check on a picture the inclusion: 

{V n G [e, 1 - 5], sup |Xf ' n - Xf'^l > i7, sup \H^ n - H br ' n \ > V } C 
s,te[i-<5,i] s,te[i-5,i] 

{ sup |^(X^),-^(X^),|>r7, sup |^ n (^) t -e y „(^) s |>r7}. 

s,tG[0,l-£] s,te[0,l-e] 
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^From Lemma H] and the straightforward identity in law X br ' n = 9y n (X br ' n ), we deduce 

the two dimensional identity in law (X br > n , H br > n ) = ( y 6 Vn (X br ' n ),9 Vn (H br > n )). Hence 
from the above inequality and inclusion, 



P I sup |Xf' n - X br > n \ > 77, sup \H^ n - H br ' n \ >7 1 \ <e + 5 + 



br,n Tjbr,n I 



s,tG[l-(5,l] 



s,te[l~S,l] 



P | sup \X*> n - X br > n \ > r), sup \H h t r ' n - H br ' n \ > 7] 

s,t6[0,l-e] s,te[0,l-e] 

But from Lemma [3] and Theorem 15.3 in [1], we have 



lim lim sup P sup \X° r ' n - X^' n \ > 77, sup \Hp n - H br ' n \ > r] 



8^0 



s,t6[0,l-e] 



s,te[0,l-e] 



which yields (I5.24p . ■ 

Proof of Theorem [3J Lemma [3] shows that the sequence of processes (X br,n , H br ' n ) 
converges toward (X br , H br ) in the sense of finite dimensional distributions. Moreover 
tightness of this sequence has been proved in Lemma EJ so we conclude from Theorem 
15.1 of [I]. The convergence of the two first coordinates in Theorem [3] is proved, 
i.e. (X br ' n , H br > n ) =>- (X br ,H br ). Then we may deduce the functional convergence of 
the third coordinates from this convergence in law following similar arguments as in 
Theorem 2.4.1 of [9J or in Theorem 3.1 of j8]: 

^From (12. 3p . we can recover the contour process of X br ' n as follows set h = 2i-Hp n , 
for < i < n. For i < n — 1 and t G 

br , n = f (H*' n -(t- if t e [h, b l+1 - l), 

^ 1 (jz£j* - (b i+1 - t))+, if t e [b i+1 - 1, b i+1 ), ■ 



a 



Hence for < i < n — 1, 



sup 

bi<t<b i+ i 



^ybr,n Tjbr,n 



< 



Tjbr,n Tjbr,n 

- H { 



+ 1. 



(5.25) 



Now, we define h n (t) — i, if t e [bi, and i < n — 1, and h n (t) — n, if t G [2n — 2, 2n]. 
The definitions of 6j and /i n implies 



sup 

0<t<2n 



h n (t) 



2 0<fc<n 

Next, we set f n {t) = h n {nt)/n. By (I5.25p . we have 



< - sup ii fc + 1 . 



sup — 

0<t<2 71 



s~ibr,n Trbr,n 
U nt/2 ~~ U nf n (t) 



< — sup 

n o<t<i 



Trbr,n Tjbr,n 
n [nt]+l ~ n [nt] 



+ — , 

n 



and 



sup 

0<t<2 



/»(*) " 2 



1 frbr,n . 1 

^ " SU P — -"fc + ~- 



2a n o<fc<n w 
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^From our hypothesis, we get 



— sup 

n o<*<i 



br,n 
nt]+l 



br,n 
nt] 



H > as n — > oo, 

n 



and 



° n frbr.n , 1 n 

— sup — -H. H ► as n — > oo, 



2a n 0<fc<n 7i P 

in probability. Hence, from the convergence of (X fer ' n , H br,n ) to (X br ,H br ), from The- 
orem 4.1 in [1] and Skorokhod representation theorem we obtain the convergence, 



Remarks: By a classical time reversal argument, the weak convergence of the first 
coordinate in Theorem [3] implies the main result of Bryn- Jones and R.A. Doney [5]. 
Indeed, when X is the standard Brownian motion, it is well known that the returned 
first passage bridge (s + X^_ u , < u < 1) is the bridge of a three dimensional Bessel 
process from to s with length 1. Similarly, the returned discrete first passage bridge 
whose law is this of (k n + S n -i, < % < n) under P( ■ | T(k n ) = n) has the same law 
as (Si, < i < n) given S n = k n and conditioned to stay positive. Then integrating 
with respect to the terminal values and applying Theorem [3] gives the result contained 
in 0. 
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